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Abstract 

The constraint complexity of a graphical realization of a linear code is the maximum dimension 
of the local constraint codes in the realization. The treewidth of a linear code is the least constraint 
complexity of any of its cycle-free graphical realizations. This notion provides a useful parametriza- 
tion of the maximum-likelihood decoding complexity for linear codes. In this paper, we prove the 
| surprising fact that for maximum distance separable codes and Reed-Muller codes, treewidth equals 

{NJ ■ trelliswidth, which, for a code, is defined to be the least constraint complexity (or branch complex- 

ity) of any of its trellis realizations. From this, we obtain exact expressions for the treewidth of these 
codes, which constitute the only known explicit expressions for the treewidth of algebraic codes. 

hH ! 

o ; 1 Introduction 

£T) \ A (normal) graphical realization of a linear code C consists of an assignment of the coordinates of C 

to the vertices of a graph, along with a specification of linear state spaces and linear "local constraint" 
^ . codes to be associated with the edges and vertices, respectively, of the graph |@). Cycle-free graphical 

O- \ realizations, or simply tree realizations, are those in which the underlying graph is a tree. Tree realiza- 

tions of linear codes are interesting because the sum-product algorithm (SPA) on such a realization is an 
. exact implementation of maximum-likelihood (ML) decoding ifToll . The notion of constraint complexity 

of a tree realization was introduced by Forney |]5] as a measure of the computational complexity of the 
corresponding SPA algorithm. It is defined to be the maximum dimension among the local constraint 
codes constituting the realization. The treewidth of a linear code is the least constraint complexity of 
any of its tree realizations. 

The minimal tree complexity measure defined for linear codes by Halford and Chugg is a close 
relative of treewidth. There are also closely related notions of treewidth defined for graphs and 
matroids (7]; these relationships are discussed in more detail in ifTOll . Known facts about the treewidth 
of graphs and matroids imply that computing the treewidth of a code is NP-hard. 

For a length-n linear code over the field ¥ q , the computational complexity of implementing ML 
decoding, via the SPA on an optimal tree realization, is 0(nq l ), where t is the treewidth of the code 
iTTOTl . In particular, ML decoding is fixed-parameter tractable with respect to treewidth, which means 
that for codes whose treewidth is bounded by a fixed constant t, ML decoding can be performed in 
polynomial time. Thus, treewidth provides a useful parametrization of ML decoding complexity. 

Trellis representations (or trellis realizations) of codes are special cases of tree realizations which 
have received extensive attention in the literature (see e.g., liT4l ). In the context of trellis representations, 
constraint complexity is usually called branch complexity. We define here the trelliswidth of a code to 
be the least branch complexity of any of its trellis representations (optimized over all possible orderings 
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of the coordinates of the code). As trellis representations are instances of tree realizations, trelliswidth 
is at least as large as treewidth. In fact, it is known that trelliswidth can be much larger than treewidth: 
it was shown in ifTTTl that the ratio of trelliswidth to treewidth can grow at most logarithmically with 
blocklength, and that there are codes with arbitrarily large blocklengths that achieve this logarithmic 
growth rate. The only known code family achieving logarithmic growth rate of this ratio is a family 
consisting of cut-set codes of a certain class of graphs. The codes in this family all have treewidth equal 
to 2, and rate approximately 1/4, but minimum distance only 4 ifTOl . 

It is not known if there are any other code families for which there is a significant advantage to 
be gained in going from trellis representations to tree realizations that are topologically more complex. 
In the only previous investigation reported on this question, Forney considered the family of Reed- 
Muller codes. He showed that for a certain natural tree realization of Reed-Muller codes, obtained 
from their well-known recursive \u\u + v\ construction, the constraint complexity is, in general, strictly 
larger than the trelliswidth of the code. But this still leaves open the possibility that there may be other 
tree realizations whose constraint complexity beats trelliswidth. In particular, it leaves undecided the 
question of whether the treewidth of a Reed-Muller code can be strictly less than its trelliswidth. 

In this paper, we show that for Reed-Muller codes, treewidth is equal to trelliswidth. The proof of 
this makes use of structural properties known for optimal trellis realizations of Reed-Muller codes, and 
also relies strongly on a certain separator theorem for trees. A similar proof strategy also works on the 
much simpler case of maximum distance separable (MDS) codes, where again we show that treewidth 
equals trelliswidth. These results yield the first explicit expressions for the treewidth of classical alge- 
braic codes. 

The rest of this paper is organized as follows. After providing the necessary definitions and notation 
in Section |2 we describe, in Section [3] our proof strategy for showing that treewidth equals trelliswidth 
for certain codes. Sections |4] and [5] deal with MDS and Reed-Muller codes, respectively. The technical 
details of some of the proofs are given in appendices. 

2 Preliminaries and Notation 

The notation [n] denotes the set of positive integers from 1 to n; [a,b] denotes the set {i € Z : a < 
i < b}. An (n, k) linear code is a code of length n and dimension k. The n coordinates of the code are 
indexed by the elements of an index set /; unless specified otherwise, I = [n]. Given a linear code C 
with index set /, for J = {j\ , j'2 , . . . , j s } C I, the shortening of C to the coordinates in J is denoted Cj 
and defined as follows: 

Cj = {cjiCjj . . . Cj B : cic 2 . . . c n € C, Ci = for i J}. 
The notions of treewidth and trelliswidth are central to this article, and we define these next. 

2.1 Treewidth and trelliswidth 

For brevity, we provide only the necessary definitions and main results; for details, see Bl. lllOl . 

A tree is a connected graph with no cycles. The set of nodes and the set of edges of a tree T are 
denoted by V(T) and E(T), respectively. Degree-1 nodes in a tree are called leaves, and all other nodes 
are called internal nodes. We let L(T) denote the set of leaves of T. A tree is a path if all its internal 
nodes have degree 2; and is a cubic tree if all its internal nodes have degree 3. A path with at least one 
edge has exactly two leaves; a cubic tree with n leaves has n — 2 internal nodes. 

Let C be an (n, k) linear code with index set /. A tree decomposition of C is a pair (T, uj), where T 
is a tree and uj : I — > V(T) is an assignment of coordinates of C to the nodes of T. 

Given a tree decomposition (T,uj) of C, for each node v of T, we define a quantity k v as follows. 
Let E(v) denote the set of edges of T incident on v. For e € E(v), let T e „ denote the component of 



T — e (T with e removed) not containing v. Finally, let I ev = uj 1 (V(T e>v )) be the set of coordinates 
of C that are assigned to nodes in T ejV . Then, 

Kv = k- ^ dim(Cj e J. (1) 

e£E(v) 

The quantity k v above is the dimension of the local constraint code at node v in the minimal realization 
of Con (T,w), denoted by M(C;T,uj). 

Let k(C:T,uj) = max k v denote the constraint complexity of M.(C:T, uj). The treewidth of a 

vev(T) 

code C, denoted by k(C), is then defined as 

k(C) = min k(C;T, uj). (2) 

It is, in fact, enough to perform the minimization in © over cubic trees T with n leaves, and mappings 
uj that are bijections between I and L(T). 

The trelliswidth of C, which we will denote by r(C), can be defined using the above notation as 
follows: 

t(C) = min k(C; P, it), (3) 

7T 

where P is the path on n nodes, and the minimization is over mappings tt that are bijections between / 
and V(P). From © and ©, it is clear that n(C) < r(C). 

Let ui , t»2 , . . ■ , v n be the nodes of the path P, listed in order from one leaf to the other. For the 
bijection tt : I — > V(P) that maps z to Uj (1 < i < n), we obtain from (fl]), 

K Vi = k- dim^ji^-i]) - dim(C 7r[i+ i in ]), (4) 

where %[a, b] = {ir(j) : a < j < b}. 

2.2 Generalized Hamming weights 

The generalized Hamming weights of a linear - code, introduced and studied in lfI31 . limit the possible 
dimensions of shortened versions of the code. So, they are related to the complexity of tree realizations 
in a natural way. 

Let C be an (n, k) linear code with index set /. We will use the notation V C C to say that V is a 
subcode of C. For a subcode V C C, we define its support x(^) = : 3 C1C2 . . . c n ET> s.t. q ^ 0}. 
The p-th generalized Hamming weight of C, denoted d p {C), is the size of the smallest support of a 
p-dimensional subcode of C, i.e., d p (C) = min{|x(P)| : T> C C,dim(P) = p} for 1 < p < k. It is 
known that < d\{C) < di{C) < ■ ■ ■ < dt{C) < n. Also, d\{C) is the minimum distance of C. 

A closely related definition is that of maximal limited-support subcode dimensions. For 1 < s < 
n, U S (C) is defined to be the maximum dimension of a subcode of C with support at most s, i.e., 
U S (C) = max{dim(D) : V C C, \x{^)\ < s }- The maximal limited-support subcode dimensions can 
be computed using the generalized Hamming weights as follows: 

U S (C) = u such that d u (C) <s< d u+1 (C) (5) 

with the convention that do(C) = and dt+i{C) = n + 1. We also define Uq(C) = 0. 

3 The Proof Strategy 

From the relevant definitions, treewidth cannot exceed trelliswidth for any code C, i.e., k(C) < t(C). 
We now describe a general strategy that can be used to show the opposite inequality in certain cases. 



Consider an (n, A;) linear code C, with index set /. The idea of using maximal limited-support 
subcode dimensions to study the complexity of trellis realizations of C was introduced in J9). We extend 
that idea to tree realizations here. For J C I, Cj is a subcode of C with support at most |J|. So, 
dim(Cj) < U\j\{C). Therefore, given any tree decomposition (T,lo) of C, we obtain from dH) that for 

any v eV(T), 

K v >k- U\l.A C )- ( 6 ) 

Now, recall from the definition of treewidth that it suffices to carry out the minimization in © over 
tree decompositions (T, lo) in which T is a cubic tree with n leaves, and w is a bijection between / and 
L(T). For such a (T, lo), we note that | I ejV | is simply the number of leaves in T ejV , and for an internal 
node v G V(T), the summation in © contains exactly three terms. 

Let n e ^ denote the number of leaves in T e i) , and note that these numbers n e _ v are determined purely 
by the topology of T. At an internal node v in a cubic tree T with n leaves, we will list the edges in E{v ) 
in the form of an ordered triple [e\{v) e2(v) ez(v)] such that 1 < n ei t v \ v < n e2 / v \ v < n e3 r v \ v < n. If 
the node v is clear in the context, we will use the simplified notation rij = n e .( v ^ v for i = 1,2, 3. 

Suppose that T is a cubic tree with n leaves having an internal node v such that the numbers 
ni,?i2,n3 satisfy Ylf=i Um(C) < k — t{C). Then, by ©, for any bijection lo between / and L(T), 
we have k v > r(C), and hence k(C; T, oj) > r(C). Consequently, if every cubic tree with n leaves had 
such a node v, then we would have k(C) > t(C). Since the opposite inequality is always true, we have 
proved the following proposition. 

Proposition 1. Let C be an (n, k) linear code with the property that for any cubic tree T with n leaves, 
there always exists an internal node v G V(T) such that Yli=i U ni (C) < k — r(C), where m = n e .r v \ v . 
Then, k(C) = r(C). 

A comment on the proof strategy implied by Proposition [T] is in order. To show that k{C) > r(C) 
(and hence, k(C) = r(C)), the obvious strategy would be to show, for each tree decomposition (T, lo) 
of C, the existence of a node v G V(T) for which k v > t(C), where k v is given by ©. In general, the 
node v would depend on the tree T as well as on the coordinate assignment lo. However, in the proof 
method based upon Proposition [T] the idea is to find, for a given (T, lo), a node v G V(T) that depends 
only on the topology of T, and thus, is independent of lo, for which k v > r(C) holds. It is a remarkable 
fact that this proof strategy can be made to work for MDS and Reed-Muller codes, as we will see in 
Sections Hand [5j 

The hypothesis of Proposition Q] requires the existence of a node in any cubic tree, whose removal 
partitions the tree into components with a certain property. The property in this case is that the cor- 
responding partition of the number of leaves, n, into 711,712,713 satisfies ^£=1 U n .i(C) < k — t(C). 
Structural results of this form are known as separator theorems (see e.g., |[T3lO 

A classical separator theorem is a theorem of Jordan [|8 | that states that any tree on 71 nodes has 
an internal node whose removal leaves behind connected components with at most n/2 nodes each. A 
trivial modification of the simple proof of this theorem shows that the two occurrences of "nodes" in the 
theorem statement can be replaced by "leaves". For easy reference, we record this as a proposition for 
the special case of cubic trees. 

Proposition 2. In any cubic tree with 71 > 3 leaves, there exists an internal node v such that n e .i v \ v < 
n/2 fori = 1,2,3. 

Another classical (edge) separator theorem is the following result (cf. |[T3l ): every cubic tree T with 
71 leaves contains an edge e such that both components of T — e have at most 2n/3 leaves. Now, one 
of these two components must have at least n/2 leaves; let v be the node incident with e for which this 
component is T e<u . Then, for this v, we have 713 G [n/2, 2n/3]. We record this fact below. 

Proposition 3. In any cubic tree with 71 > 3 leaves, there exists an internal node v such that n e3 r v \ v G 
[n/2,2n/3]. 



As we will see in the next two sections, Propositions [2] and [3] allow us to deal with MDS and Reed- 
Muller codes, respectively. We consider MDS codes first. 



4 Treewidth of MDS Codes 

MDS codes are (n, k) linear codes for which the minimum distance equals n — k + 1. Basic facts about 
MDS codes can be found in lfT2ll . 

Let C be an (n, k) MDS code, with index set I = [n]. The generalized Hamming weights of C were 
computed in |[T5l as follows: 

d p (C) = n — k + p, 1 < p < k. 

From this, the maximal limited-support subcode dimensions, U S (C) for 1 < s < n, can be determined 
using ©. They are given by 

C/,(C) = {°' 1 (7) 
[q, s = n — k + q, q = 1,2, • • • , k. 

Equivalently, U S (C) = max{0, s — (n — k)}. We use this to compute r(C) next. 

Let H be a parity-check matrix for C. For a subset J C I, the code Cj has dimension equal to 
| J\ — rank(H\j), where H\j refers to the restriction of H to the columns indexed by J. As C is MDS, 
rank(f/~|j) = min{|J|,n — k}. Hence, dim(Cj) = max{0, \ J\ — (n — k)} = U\j\(C). Therefore, for 
any permutation it of /, we have for integers 1 < a < b < n, dim(C 7r [ a >b j ) = C4_ a+ i(C). Therefore, 
the right-hand-side of © is always equal to k — Ui-\{C) — U n -i(C). It follows directly from this that 

r(C) = max (k - U^C) - U n ^{C)) = k - min (C) + U n ^{C)). 

l<i<n l<i<n 

A straightforward computation using (|7) yields 

. N ( I 0, if n — k > k, 

min (Ui-t (C) + C^_i(C = ' " ' 

i<i<n l2A;-n — 1, if n - k < k. 

achieved for i = n — k + 1. We thus have the following result. 

Proposition 4. The trelliswidth of an (n, A;) MDS code C is given by t(C) = min{A, n — k + 1}. 
With this, we have 

k - t(C) = max{0, 2k - n - 1}. (8) 
We can now prove that the treewidth of an MDS code equals its trelliswidth. 

Theorem 5. For an (n, k) MDS code C, we have 

k(C) = t(C) = min{/c, n — k + 1}. 



Proof. The statement is trivial for n = 1, 2, or when k = n, so we assume n > 3 and 1 < n — fe. Let 
T be a cubic tree with n leaves, and let v be the node guaranteed by Proposition [2] We will show that v 
satisfies the hypothesis of Proposition [T] 

Set ni = n e .r v \ v , i = 1,2, 3, and recall that, by definition, n\ < ?i2 < ?^3- By choice of v, we also 
have n.j < n/2 for i = 1,2, 3. For convenience, we write U Hi for U ni (C). 

Case 1: n — k > k . 

In this case, n^ < n/2 < n - k, so that £\ f7 ni = by ©. Moreover, by ®, A; - r(C) = 0. 



Case 2: 1 < n - k < k . 

Now, we have rii < n/2 < k. We must show that J2i U ni < 2k — n — \. If 723 < n — k, then 
Y2i U ni = 0- So, we assume 77,3 = k — 5, with 1 < 5 < 2k — n. Then, U m = n^ — {n — k) = 2k — n — 5 
and n\ + ri2 = n — 723 = ri — k + <5. So, we have 

f^m + ^n 2 + U na = max{0, k — n + rii} + max{0, k — n + 722} + 2/c — n — 5 

< max{0, k — n + ni, k — 11 + 712, 2k — 2n + n± + 712} + 2k — n — 5 
= max{2/s — n — 5, 3k — 2n + 722 — 5, 3k — 2n} 

< 2k-n-l, 

where the last inequality holds because 5 > 1, 122 < n — k + 5 — 1 and n — k > 1. 

Thus, in both cases, we see that J2i U n , < k — r(C), and so, by Proposition [Q we have k{C) = 
t(C). □ 



5 Reed-Muller codes 

For a positive integer m and a non-negative integer r with < r < m, the r-th order binary Reed- 
Muller code of length 2 m , denoted RM(r,m), is defined as follows. Let P™ denote the set of all 
Boolean polynomials in m variables of degree less than or equal to r. For an integer i, < i < 2 m — 1, 
with binary expansion i = Y^j=o bj(i)2^, bj(i) G {0,1}, we let b(i) = (bo(i), • • • , 6 m _i(i)). 
For / € P r m , let /(b(i)) = f(b (i),h(i),--- ,b m -i\i)). The code RM(r, m) is defined as 

RM(r,m) = {[/(b(0))/(b(l)) ••• /(b(2 m - 1))] : / G P r m }- (9) 

The code RM(r, m) has length n = 2 m , dimension fc(r, m) = Sj=o (7)' an( ^ m i nrmum distance 2 m ~ r 
lTT2l . In (O, the order of evaluation of the function / is according to the index set / = [0, 2 m — 1]. This 
is called the standard bit order. 

We will denote the treewidth and trelliswidth of RM(r, m) by k(t, m) and r(r, m), respectively. 



5.1 Trelliswidth of RM(r, m) 

Let C be the Reed-Muller code RM(r, m) in the standard bit order, so that / = [0, 2 m — 1]. In this 
section, we derive an exact expression for the trelliswidth of C. 

Let P be the path on n = 2 m nodes, with vo,v±,..., v n ^\ being the nodes of P, listed in order 
from one leaf to the other. For any 7r : / — s> V{P), we obtain from dU), in a manner analogous to the 
derivation of ©, 

k Vi > k(r,m) - Ui(C) - U n -i-i(C), 

for i = 0, 1, . . . , n — 1. Thus, 

k{C;P,tt) > k{r,m) - min (Ui(C) + C/„_i_i(C)). (10) 

0<i<n— 1 

Note that the right-hand-side is independent of n, so that by ([3]), 

r(C) > k(r,m) - min (^(C) + C/ re _i_i(C)). (11) 

0<t<n— 1 

It is shown in that for RM(r, m) in the standard bit order, we have for i = 0, 1, . . . , n — 1, 

dim(C [0!i] )) = Z7i+i(C) and dim^ ,„„!]) = U n -i(C). (12) 

It follows that when ir simply maps i to vi for all i G /, then we have equality in (fTOl) . and hence, in (fTTI) . 
To put this another way, the branch complexity of the minimal trellis representation of RM(r, m) in the 
standard bit order attains the lower bound on, and thus equals, the trelliswidth of the code. Techniques 
from 13 allow us to compute, with very little effort, the branch complexity of this trellis representation. 
We give the details of this computation in Appendix A. From this, we obtain the following result. 



Proposition 6. The trelliswidth of the Reed-Muller code RM(r, to) is given by 

X^oC";^ 1 ) ifm>2r + l, 

^m—r—l /m—2j- 
■*j=0 \ r—j 



r(r, to) 



1 + E, m =o r_1 {^r 1 ) ifm<2r + l. 



Recall that the dimension of the code RM(r, m) is given by k(r, m) = Ej=o {■)• We w ^ ^ n( ^ 
it convenient to define k(r',m') to be Ej=o ("j ) f° r ai l non-negative integers r',m', including when 
r' > m' . with the usual conventions that ([j) = 1 and (™ ) =0 for j > to'. Thus, for r' > m' > 0, 
k(r',m') = 2 m ' . Following these conventions, we give an expression for the difference k(r,m) — 
r(r, to). 

Proposition 7. For the Reed-Muller code RM(r, m), we have 

min{2(r— l),m— 1} 

k(r, to) — r(r, to) = k{r — 1 — [«/2] , m — 1 — i). 

We present the algebraic manipulations required to prove this proposition in Appendix A. 

It is instructive to explicitly write out some of the terms of the summation in the last proposition. 
When to > 2r, we have 

k(r, to) — r(r, to) = k(r — 1, m — 1) + k{r — 2, to — 2) + k{r — 2, to — 3) 

+ k(r — 3, to — 4) + /c(r — 3, to — 5) 
+ h /c(0, to - 2r + 2) + k(0, to - 2r + 1), (13) 

and when to < 2r — 1, we have 

k(r, to) — r(r, to) = /c(r — 1, to — 1) + k{r — 2, to — 2) + /c(r — 2, to — 3) 

+ k(r — 3, to — 4) + /c(r — 3, to — 5) 
+ ... + k(r-l-\*i=Z],l) + k(r-l-\&=±],0). (14) 

5.2 Treewidth of RM(r, m) 

We state below our main result showing that the treewidth of a Reed-Muller code equals its trelliswidth. 
Theorem 8. The treewidth of the Reed-Muller code RM(r, to) is given by 

E^or^T 1 ) ifm>2r + l, 

^m—r—l Im—lj— 
jj=0 \ r—j 



n{r, to) = r(r, to) 



The rest of this section is devoted to a proof of the above result, which follows the strategy outlined 
in Section [3] Some of the technical details of the proof are presented in Appendices B and C. 

Let RM(r, m) be given. If to < 2, or r = to, then RM(r, m) is an MDS code, which has been dealt 
with in Section |U Henceforth, we will assume to > 3 and r < to — 1. 

Let T be a cubic tree with n = 2 m leaves, to > 3, and let W = {v € V(T) : n e3 r v \ v € 
[n/2, 2n/3]}. By Proposition |3l W is non-empty. Let v* G W be a node that achieves maxln^^ „ : 
v G VF}. Write n* = n e .( v *^ v *, i = 1, 2, 3. 

Lemma 9. We have n/6 < n\ < n/3. 




;— □ 



•—a 



>n% 



"3 



Figure 1: For v, we have = n\ + ra|. 



Proo/ If n* 2 < n/6, then from the fact that n* < n|, we obtain + < n/3, so that > 2n/3, a 
contradiction. So, 122 > n/6. However, n/6 is not an integer for n = 2 m , and so, n\ > n/6. 

If n\ > n/3, then n^ + n^ < 2n/3. Let « be the neighbour of v* incident with edge e2{v*). Then, 
setting n3 = n es ( v ^ v , we see that = n\ + see Figure [T] But this means that n^ < n% < 2n/3, 



which contradicts our choice of v*. 



□ 



We will show that Ylf=i U n * < k(r, m) 



t r, to 



), which will prove Theorem [8]by virtue of Propo- 



sition [TJ Here, and in all that follows, we use Uh as shorthand for C/^(RM(r, to)), 

Denote by ft( m ) and the largest integers in [0, 2n/3] and [0, n/3], respectively. Explicitly, 



and 



Equivalently, in binary form, 



(m) 



(m) 



3 ' Z 




if to is odd , 
if to is even, 

if to is odd , 
if to is even. 



(15) 



(16) 



b(a(' m )) 



(1,0,1,0,1, - ■■ ,0,1) if to is odd, 
(0, 1, 0, 1, • • • ,0, 1) if to is even, 



(17) 



and 



m) * 



(0,1,0,1,0, 
(1,0, 1,0,-- 



• ,1,0) 

1,0) 



if to is odd, 
if to is even. 



(18) 



When there is no ambiguity, we will drop the superscripts from «( m ) and /3( m ) for notational ease. 

Now, what we know is that n% G [2 m_1 , ft] and n| 6 2 m ] , 0\. In fact, it can be directly verified 
from the expression for a that [~i 2 m ] = ft — 2 m_1 + 1. We wish to show that U n * < k(r, to) — 
r(r, to). We will do this in two steps: first, we show in Lemma [TOlbelow that ^ U n * < U a + Up + U\, 
and then, we prove in Lemma [TT1 that U a + Up + U\ = k(r, to) — r(r, to). 

Write n?5 = ft — i and n?j = /3 — j, so that nj = 2 m — (ng + n^) = i+ j + 1, where i € [0, a — 2 m_1 ] 
and j e [0, /3 - (ft - 2 m " 1 + 1)]. The following lemma shows that £ I7 n . < C/ a + + C/i. 



Lemma 10. For ie[0,a- 2 m ~ 1 ], and j e [0, - (a - 2 rn ~ l + 1)], we have 

(U a - U a -i) + (Up - Up_j) > U t+j+1 - Ul 

Proof. See Appendix B. □ 
Lemma 11. U a + Up + U\ = k(r, m) — r(r, m). 

Proof. The minimum distance of RM(r, m) is 2 m ~ r . Since we have assumed r < m — 1, the minimum 
distance is at least 2, and hence, U\ = 0. In Appendix C, we show the following: when m > 2r, 



Ei=o k(r-l-i,m-l-2i) if . 



a, 



U s = I ^'i (19) 
\E*Ii%-l-*»w-2t) if s = /?. 

Examining the above summations tenn-by-term, it may be verified that the alternate terms on the right- 
hand side of (fT3l) . beginning with k(r — 1, m — 1), sum to U a , while the remaining terms sum to Up. 
Hence, when m > 2r, the statement of the lemma holds. 
When m < 2r, we show in Appendix C that 

m — 1 

. y~l- n fefr — 1 — i, m — 1 — 2i) if m is odd, 
C/ a = < (20) 



Ej=o — 1 — i,m — 1 — 2i) if m is even. 



and 



U = J ^*=i K r - l -h m ~ 2i ) if mis odd, 
I Ej=i — ^ — i,m — 2i) if m is even. 

This time, it can be seen that the alternate terms on the right-hand side of (fl4l) . beginning with k(r — 
1, m — 1), sum to while the remaining terms sum to Up. This completes the proof of the lemma. □ 

With this, the proof of Theorem |8]is complete. 



6 Concluding Remarks 

In this paper, we proved the surprising fact that for the families of MDS and Reed-Muller codes, if we 
use the maximum dimension of local constraint codes to measure the complexity of a graphical realiza- 
tion, then there is no advantage to be gained in going from trellis realizations to cycle-free realizations 
on more complex tree topologies. This is particularly surprising for Reed-Muller codes, given that they 
have a natural binary-tree structure arising from the recursive \u\u + v\ construction (see e.g. [|5l). Of 
course, the situation could be different if we used some other measure for the complexity of a graphical 
realization, for example, the sum of the local constraint dimensions. 

It is also quite remarkable that the proof strategy outlined in Section [3]- namely, identifying in any 
cubic tree T a node v £ V(T) such that k v > r(C) for every tree decomposition of the code C on 
T — succeeds for MDS and Reed-Muller codes. As noted in that section, this strategy ignores the 
role played by the coordinate assignment u in determining the local constraint code dimension, k v . It 
seems unlikely that this method of proof would succeed for other code families. It would of course 
be interesting to devise a set of tools that could be used to compute treewidth, or simply to determine 
whether or not treewidth can be strictly less than trelliswidth, for other families of algebraic codes. 



Appendix A: Proofs of Propositions |6] and 



In this appendix, we compute the branch complexity of the minimal trellis representation of RM(r, m) 
in the standard bit order, from which the expressions in Proposition [6] and [7J are obtained. We refer the 
reader to the survey by Vardy frPfl for the necessary background on the theory of trellis representations. 

Let r(r, m) and a(r, m) denote, respectively, the branch complexity and state complexity of the 
minimal trellis representation of RM(r,m) in the standard bit order. Berger and Be'ery gave an 
explicit expression for a(r, m): 

min{r,m— r— 1} 



<j(r,m) = [ r - i )' 



A different derivation of the above was given by Blackmore and Norton Q. We rely heavily on tools 
from [ 2 ] to prove the following result, which is equivalent to Proposition [6] 

Proposition 12. 

a(r, m) ifm > 2r + 1, 



r(r, m) 



<r(r, m) + 1 m < 2r + 1. 



We introduce some terminology and notation that will be needed in the proof of the proposition. Let 
C be the code RM(r, m) in the standard bit order, and let n = 2 m . Let Tbe the minimal trellis of C. For 
i = 0, 1, . . . , n, the dimension of the state space at depth i in T is denoted crj. Thus, a(r, m) = maxj a{. 
For i = 0, 1, . . . , n — 1, we denote by Tj the dimension of the branch space between the state spaces at 
depths i and i + then, r(r, m) = maxj Tj. 

The following definitions were made in for < i < n — 1 : 

(a) if dim(C[j_|_i n _i]) = dim(Cfj_i :re _i|) — 1, then % is called a point of gain of C; and 

(b) if dim(Cr 0) i]) = dim(C[ .j_i]) + 1, then % is called a point of fall of C. 

As per our notation from Section|5] h(i) denotes the m-bit binary representation of i, < i < n — 1. 
Let |b(z)|o and |b(i)|i denote the number of 0s and Is, respectively, in b(i). 

Lemma 13 ((H, Proposition 2.2). For < i < n - 1, 

(a) i is a point of gain ofC iff |b(i)|i < r; 

(b) i is a point of fall ofC iff |b(i) |q < r. 



Proof of Proposition [72] It is a fact that for any minimal trellis representation, branch complexity 
either is equal to the state complexity or is exactly one more than the state complexity. In particular, 
a(r, m) < r(r, m) < a(r, m) + 1. So, to prove Proposition [T2l it suffices to show that 

r(r, m) = a(r, m) + 1 iff m < 2r. (22) 

Suppose that r(r, m) = T{ for some % G [0, n — 1]. From the local behaviour of T described in ||2] 
p. 44], it follows that we can have Tj = a(r, m) + 1 iff <jj = a(r, m) and i + 1 is a point of gain as well 
as a point of fall of C. 

Thus, if Tj = a(r, m) + 1, then by Lemma [T3l m = \h{i + l)|i + \h{i + l)|o < 2r. This proves the 
"only if" direction of (l22l . 

Conversely, suppose m < 2r. The proposition is clearly true if m = r, since RM(m, m) = 
{0, l} 2 ™ , and we have a(m, m) = and r(m, m) = 1. So, we may assume m > r + 1. Take z to be 



such that b(i) = (0, . . . , 0, 1, 0, 1, 0, . . . ,1,0), with |b(i)|i = m -r - 1. Then, by Theorem 2.11 in 
O, cr £ = a(r,m). Also, b(z + l) = (1, 0, . . . , 0, 1, 0, 1, 0, . . . , 1, 0), with |b(i + l)|i = m - r < r and 
\h(i + 1 ) | o = m — (m — r) = r. Hence, by Lemma [T3l i + 1 is a point of gain as well as a point of fall 
of C. Hence, r, = a(r, m) + 1, which completes the proof of (l22l . and hence, of Proposition [T2l □ 

We next present the algebraic manipulations needed to prove Proposition [7] 

Proof of Proposition^ We divide the proof into three cases. 
Case 1: m > 2r + 1. We have 



k(r, m) — r(r, m) 



E 

r 

E 

3=0 
r 

E 



E 

3=0 



E 

3=0 



m — 2 j — 1 
r _ j 

m — 2(r — j) — 1 
j 



m — 2(r — j) — 1 



(a) 



(6) 



m — 1 — i 
J-l 

m — 1 — i 

i-i 



r 2(r-j) 

E E 

j=l i=0 
2(r-l)r-rt/2l 

E E 

i=0 j=l 
2(r-l) 

= V fc(r - 1 - [i/2],m - 1 -«')■ 

i=0 

In the above chain of equalities, equality (a) uses the fact that for integers a < b and j > 1, we 
have Q) - Q = Z b q =l this is just repeated application of the identity (*) = (J^) + ( b ^ 1 ) . 

Equality (b) is obtained by exchanging the order of the summations in i and j. 

Case 2: m = 2r . Here, 



k(r, m) — r(r, m) 



E 

i=o 

r 

E 

r 

E 

3=1 



m — 2 j — 1 
r-j 



r-l 

i-E 

3=0 

— 2(r — j) — 1 



E 

3=1 



m — 2(r — j) — 1 
3 



and now we carry on from equality (a) of Case 1. 



Case 3: m < 2r — 1. This is the most tedious case. We start with 



k(r, to) — r(r, to) 



E 

j=0 



m— i — 1 



i- E 

i=o 



E 7 - E 



to — 2j — 1 
r _ j 

to — 2(r — j) — 1 
J 



2r— m 



E 7 + E 



777 

. 7 

j=l w / j=2r-m+l L w 
2r-m / s. r 2(r-j) 

E(™)+ E E 

j=l VJ/ j=2r-m+l i=0 
2r-m / \ 2(m-r-l) r-fi/2] 

E 7 + E E 



to — 2(r — j) — 1 
J 



m — 1 — i 
3-1 



j=l i=0 j=2r— m+1 

m—1 /to— 1— 



m — 1 — i 
3-1 



Now, for j > 1, write (7) = (™) - (J) = ET=o C7-f )" Hence ' 



Also, 



2) — m 

E 

j"=i 



2(m-r-l) r-\i/2] 

E E 

i=0 j=2r-m+l 



m—1 2) — m 

EE 

i=0 j=l 



m — 1 — i 
3-1 



to — 1 — i 
3-1 



m-l r-\i/2] 

E E 

i=0 j=2r—m+l 



m — 1 — i 
3-1 



(23) 



(24) 



(25) 



as when i > 2(m — r — 1) + 1, we have r — [i/2] < 2r — to, so that the inner summation Y^j=2r-m+\ 



is empty. Plugging (1241) and (1251) into (1231) . we find that 



fc(r, to) — r(r, to) = 

i=o j=i 

This completes the proof of Proposition |7] 



m — 1 — i 
3-1 



m—1 



k(r ~ 1 ~ \i/2],m - 1 



1 ■ 



i=0 



□ 



Appendix B: Proof of Lemma [TOl 

We recast the statement of Lemma [TO] into an equivalent statement about binary representations of 
integers. From (fl2l and the notion of points of fall from [j2] (see Appendix A), we see that for 1 < 
s < 2 m , U s is equal to the number of points of fall of RM(r, m) within the interval [0, s — 1]. Thus, by 
Lemma [T3l U s is equal to the number of integers in [0, s — 1] whose m-bit binary representations have 
at least m — r Is. 

For an integer j G [0, 2 m — 1], let wt(j) denote the Hamming weight of (i.e., the number of Is in) 
the binary representation h(j). For a subset S C [0, 2 m — 1], let Wi(S) denote the number of integers 
j € S with wt(j) > i. We set Wi{%) = 0. Then, LemmafTOlis equivalent to the following assertion: for 

i E [0, a - 2 m " 1 ] and j e [0, p - (a - 2 m " 1 + 1)], we have 

w m - r ([a -i,a-l]) + w m - r (\j3 - j,f3 - 1]) > w m - r ([l, i + j]). (26) 

Since Lemma [TOl needs to be shown for any RM(r, to) with < r < m — 1, we see that (1261 ) must be 
shown for any m — r G {1, 2, . . . , m}. With this in mind, we define for S C [0, 2 rn — 1], 

wH(5) = \w x {S) w 2 (S) ••• w m (S)]. 



As usual, we will drop the superscript (m) when it can be gleaned unambiguously from the context. 
Proposition 14. For m > 2 and < i, j < 

a (m) _ ymr-\ t we 

have 

w M([ a M _f )a M _i]) +w M([/jM -j,/3M - l]) > w ( m )([l,i + j]), (27) 
w/f/i inequality above holding componentwise. 

Observe that this proposition is slightly stronger than Lemma [TOl since the latter only requires 
< j < P {m) - (oW - 2™" 1 + 1). It is easy to verify that ^ - (a^ - 2 m " 1 + 1) < - 2 m ~ 1 . 
The remainder of this appendix is devoted to a proof of Proposition [14] The proof is by induction on 
m, which is why we have taken care to include the superscripts on a and (3 in the statement of the 
proposition. The main ingredients in the inductive proof are the simple facts that for a non-negative 
integer j, wt(2j) = wt(j) and wt(2j + 1) = wt(j) + 1. The rest is merely careful bookkeeping. 

Let p( m \i,j) denote the inequality in (|27"T ). The induction argument is built upon certain implica- 
tions among the P^ m \i,j), as stated in the series of lemmas below. We introduce here some notation 
that we will use in the proofs of these lemmas. For a set of integers 5, we write 25 and 25 + 1 to mean 
the sets {2j : j € 5} and {2j + 1 : j € 5}, respectively. By with 1 < a < b < m, we mean 

the vector [z\ Z2 ■ ■ ■ z m ], with Zi = 1 for a < i < b, and Zi = otherwise. Again, we will drop the 
superscript (m) when there is no ambiguity. 

Lemma 15. For even m, P {m) (i,j) implies p( m+1 )(2i + l,2j). For odd m, P {m) (i,j) implies 
P( m+1 )(2i,2j + 1). 

Proof. For even m, we have «( m+1 ) = 2a^ + 1, and /3( m+1 ) = 2^ m \ Set 5 = [aW - i, - 1] 
and T = - j, ^ - 1]. Now, P( m \i,j) implies 

w H+i)(25) + w ( m+1 )(2r) > w (^+i)( 2 [l,i + j]) (28) 
w (™+i)(2S + i) +w (™+i)(2T + l) > w( ,n+1 )(2[M+j] + l) (29) 

since wt(2j) = wt(j') and wt(2j + 1) = wt(j') + 1 for any non-negative integer j. Henceforth, all the 
w's in this proof are w( m+1 ) 's. Combining (l28l) and (l29l . we have 

w([2a( m ) - 2i, 2a^ - 1]) + w([2/?( m ) - 2j, 2/3^ - 1]) > w([2, 2i + 2j + 1]), 

which is the same as 

w ([ a (™+i) _2i-l, a (m+1) -2])+w([/?( m+1 ) -2j,/3( m+1 ) -1]) > w([2,2i + 2j + l]). (30) 

Now, w([l, 2i + 2j + 1]) = w([2, 2i + 2j + 1]) + lf™"!^. Also, w([a( m+1 ) - 2i - 1, a ( m+1 ) - 1]) = 

w ([ a ("i+i) _2i_i, a (™+i) _2]) + l[ 1 m ^ ] ,since wt(a( m+1 ) - 1) =wt(2a( m )) = wt( a ( m )) = m/2, 
by (fTTT ). Therefore, 

w ([ a (™+i) _2i-l, a ( m+1) - 1]) +w([6( m+1) -2j,6( m+1 ) -1]) > w([l,2i + 2j + l]), (31) 

which is P i - m+l \2i + l,2j). 

The proof for odd m is along similar lines. □ 

Lemma 16. (a) When a*™) — i is even, the two inequalities P( m >(i,j) and P^ m \i + 2, j) together 
imply P( m \i + l,j). 

(b) When /?( m ) — j is even, the two inequalities p( m ) (i, j) and _p( m ) (i, j+2) together imply p( m ) (i, j+ 
1)- 



Proof. We only prove (a), as the proof of (b) is completely analogous. In this proof, all omitted super- 
scripts are to be taken to be (m). 

Letx = wt(a— i— 1) andy = wt(i+j+l). We have w([a— i— 1, a— 1]) = w([a— i, a— + 
andw([l,i+ j + 1]) = w([l,i + j]) + l^j. We want to show p( m )(i + l,j): 

w([q - i, a - 1]) + 1 [1)X] + w([/3 - j, /3 - 1]) > w([l, i + j}) + (32) 

If x > y, then p( m \i,j) clearly implies (|32l ). So, suppose x < y. Then, 021 becomes 

w([a - i, a - 1]) + w([/3 - j, /3 - 1]) > w([l, i + j}) + l [x+ i, y] , 

or equivalently, 

rr • m , •« WtM + iD + 1 if* + l</<y 

WiQa-^a-lJJ + wjQ^-^/S-l]) > < . (33) 

\wi([l,i + j\) otherwise. 

Let x 1 = wt(a — i — 2) and y' = wt(i + J + 2). Since a — i is even, we see that x' + 1 = x or 
a/ < x. Now, we have 

w([a - i - 2, a - 1]) = w([a - i, a - 1]) + l[ ljX ] + 1^/] (34) 
w([l,i + j + 2]) = w([M+i]) + l M + l [iy] (35) 

Thus, p( m \i + 2, j) is equivalent to 

w([a 1]) + l [1)!r /] + w([/3 - j,/3 - 1]) > w([l,i + + l[ x +i,y] + ( 36 ) 

Using the fact that x' < x, (l36l ) implies that for x + 1 < I < y, 

wi([a - i, a - 1]) + ru,([/3 - j, /3 - 1]) > wjQl, i + j}) + 1. 

Since p( m ) clearly implies the "otherwise" part of (|33l , we have shown that p( m ) (i, j) and p( m ) (i+ 
2, j) together imply i.e., p( m )(i + 1, j). □ 

Lemma 17. For even m, the following implications hold: 
(a) P {m) (iJ) =► P( m+1 )(2i + l,2j); 
ffe; pH^-lj'jAPH^j) p( m + 1 )(2i,2j); 

(c) pW^jjApH^j + i) =^ p( m+1 )(2i + l,2j + l); 

(d) p( m )(i-l,j) AP( m ')(i,j) APN(i_i )J + i) AF(™'(i,j + l) => P( m+1 )(2i,2j +1). 

Proof, (a) follows directly from Lemma H"5l 

(b) : If PM (i - 1, j) and p( m ) (i, j) are true, then by Lemma[l5] we have p( m + x ) (2i - 1, 2j) and 
p( m+1 )(2i + 1, 2j) being true. Since m + 1 is odd, a (m+1 ' ) is odd (see (fTTT)). It now follows from 
LemmafUta) that p( m+1 )(2i, 2j) holds. 

(c) : This follows by an argument similar to part (b), except that Lemma [ToT b) is applied. 

(d) : By part (b), p( m + x ) (2i, 2j) and p( m + x ) (2i, 2j+2) hold. Therefore, by LemmafTBlb). p( m + x ) (2i, 2j+ 
1) holds. □ 



Arguments similar to those used in the above proof show the next result. 



Lemma 18. For odd m, the following implications hold: 

(a) P( m \i,j) =► P( m+1 )(2i,2j + 1); 

(b) pW^j-ijApH^j) =^ P( m+1 )(2f,2j); 

(c) pW(i,i)APW(i + l,j) =► P( m+1 )(2i + l,2j + 1); 

fjj P( m )(i,j-1) AFH^j) APH(! + I,j-1) APH(i + l,j) p( m+1 )(2i + l,2j). 

We are now in a position to prove Proposition H"4l 

Proof of Proposition^ Set £< m ) = «( m ) - 2 m_1 . We wish to show that for m > 2, P^ m \i,j) holds 
for < i, j < & n \ It is easy to verify this directly for m = 2 and m = 3, so we start the induction by 
assuming that for some odd m > 3, p( m \i,j) holds for < i, j < & n \ 

For odd m, the implications in Lemma [T8l are enough to show that p( m+1 )(i, j) holds for 1 < i < 
2£^ and 1 < j < 2£^ + 1. Note also that for odd m, we have ^ m+1 ) = 2^ m \ as can be verified 
from ([15). Since p( m + x ) (0, 0), p( m + 1 ) (0, 1) and P^ 1 ) (1,0) trivially hold, we have that p( m + 1 ) (j, j) 
holds for < i < ^ m+1 ) and < j < ^ m+1 ' + 1. 

Now, m + 1 is even, and we have shown above that P^ m+1 \i, j) is true for < i < ^( m+1 ) and 
< j < ^( m+1 ) + 1. The implications in Lemma [T71 are then sufficient to show that p( m+2 \i,j) holds 
for 1 < i,j < 2£( m+1 ) + 1. Again, P( m+2 )(0,0), P( OT+2 )(0, 1) and P( m+2 )(1,0) can be seen to hold 
trivially, so p(' m+2 \i,j) in fact holds for < i, j < 2& n+l '> + 1. This completes the induction step, 
since for even m + 1, it follows from CG]> that 6 m+ ^ = 2^( m+1 ) + 1. □ 

As observed earlier, Proposition [improves Lemma HQ] 

Appendix C: Computing U a and Up 

To derive the expressions in (fT9T> — (|2TT) . we make use of © and a result of Wei lfT31 that explicitly 
determines the generalized Hamming weight hierarchy of RM(r,ro). Any non-negative integer u < 
k(r, m) can be uniquely expressed as a sum 

i 

u = y^fc(rj,mj), (37) 
i=i 

where r > r\ > T2 > ■ ■ ■ > Ti > 0, m > mi > ni2 > ■ ■ ■ > rri£ > 0, and for all i, rrn — ri = 
m — r + 1 — i [Q21 Lemma 2]. The above representation is called the (r, m)-canonical representation 
of u. 

Theorem 19 ( ||T5l . Corollary 6). For < u < k(r,m), given the unique (r, m) -canonical representa- 
tion ofu as in 071 ). we have d u (RM(r, m)) = Yli=i 2 mi - 

For convenience, we will henceforth write d u (RM(r, m)) simply as d u . 

Assume that m > 2r. We want to show that (fT9l holds. We will only prove here the result for 
s = a, as the result for s = can be proved analogously. Let u be the integer given by 

r 

u = ^k(r - i,m + 1 - 2i). (38) 

i=i 

Note that the above is the (r, m)-canonical representation of u. By Theorem [T9J we have d^ = 
E*=i 2 m+1 " 2i . In binary form, b(c^) = (0, 0, . . . , 0, 0, 1, 0, 1, . . . , 0, 1), the number of Is in b(dfi) 
being r. Comparing this with the binary form of a given in (fTTT ). it is clear that d& < a. 



Next, write u + 1 as 

r 

u + 1 = fe(r — m + 1 — 2i) + k(0, m — 2r), 

i=l 

using the fact that fc(0, to — 2r) = ( m ~ 2r ) = 1- This is again in (r, m)-canonical form, and hence by 
TheoremQU we have du+i = ELi 2 m+1 ~ 2i +2 m ~ 2r . In binary form, this is b(d &+1 ) = (0, 0, . . . , 0, 1, 
1, 0, 1, . . . , 0, 1), the number of Is here being r + 1. Comparing with (fTTT ). we see that a < e^+i. 

Since d# < a < we have by ©, U a = u. Observe that u as given by d38l ) is precisely equal 

to the claimed value of U a in ( fT9l ). 

Now, assume m < 2r. We wish to show d20l ) and (|2TT) . We sketch the proof for (1211 here; the proof 
for (l20l ) is similar. Set 

{ m-l 
V,_ 2 i k(r — 1 — i,m — 2i) if m is odd, 
Yli=i — ^- — i,m — 2i) if m is even. 

The above is the (r, m) -canonical representation of u, and hence, 

{ m-l 
EiJi 2 m ~ 2i if mis odd, 
— 
X^jLi 2 m if m is even. 

Comparing h(du) with b(/3) given in (fT8T ). it can be seen that d„ < /3. 
The (r, m)-canonical representation of u + 1 is given by 

{ m-l 
Ej= 2 i fe(r-l-i,m-2t) + ife(r-l-2!ci,0) if mis odd, 
EL — l 
Z~2i=i — 1 — ^, m — 2i) + fe(r — y, 1) if m is even. 

Again, can be obtained from Theorem [191 and the subsequent comparison of binary forms shows 
that /3 < du + \. Hence, by ©, we have Up = u, which proves (1211 . 
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